A Comparative Error Analysis of Audio-Visual Source Localization

نویسندگان

  • Damien Kelly
  • François Pitié
  • Anil Kokaram
  • Frank Boland
چکیده

This paper examines the accuracy of audio-video based localization using multiple cameras and multi-microphones. Covariance mapping theory is used to determine the accuracy of audio and video based localization. Both modalities are compared in terms of their ability to provide accurate location estimates of a moving audio-visual source. Relatively, video is found to be significantly more accurate than audio. The problem of audio-video fusion is also examined. The fusion of audio and video location estimates is applied in the audio domain, the video domain and the positional domain. The accuracy of these three fusion strategies for 3D localization are examined from a theoretical basis. The best localization performance is found when fusion is applied in the positional domain. Fusing audio and video data in the video domain is found to exhibit the worst localization performance. This analysis is confirmed by measuring the accuracy of each fusion strategy in localizing a moving audio-visual source.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

راهکار جدید استخراج ویژگی مبتنی بر نمونه‌برداری فشرده در پردازش سیگنال‌های صوتی

In this paper, we present a Compressive Sampling (CS)-based feature extraction method for audio signals. In the proposed approach, the audio signal is firstly segmented by hamming windows and the Discrete Fourier Transform (DFT) of the samples is calculated within each frame. Then, the normalized values of the DFT coefficients of each frame are accumulated. At the next step, the second DFT is a...

متن کامل

A Novel Sound Localization Experiment for Mobile Audio Augmented Reality Applications

This paper describes a subjective experiment in progress to study human sound localization using mobile audio augmented reality systems. The experiment also serves to validate a new methodology for studying sound localization where the subject is outdoors and freely mobile, experiencing virtual sound objects corresponding to real visual objects. Subjects indicate the perceived location of a sta...

متن کامل

The impact of wind-generated bubble layer on matched field sound source localization in shallow water (Research Article)

This paper investigates the effect of the wind-generated bubble layer on the underwater sound source localization in the Persian Gulf shallow-water environment through computer simulation and the matched field processing technique. An underwater sound source of 2-10 kHz located at depths of 10, 45, and 75 m was considered at a distance of 4 km from a linear vertical receiver array. The estimati...

متن کامل

Comparing the Impact of Audio-Visual Input Enhancement on Collocation Learning in Traditional and Mobile Learning Contexts

: This study investigated the impact of audio-visual input enhancement teaching techniques on improving English as Foreign Language (EFL) learnersˈ collocation learning as well as their accuracy concerning collocation use in narrative writing. In addition, it compared the impact and efficiency of audio-visual input enhancement in two learning contexts, namely traditional and mo...

متن کامل

Neuromorphic Audio–Visual Sensor Fusion on a Sound-Localizing Robot

This paper presents the first robotic system featuring audio-visual (AV) sensor fusion with neuromorphic sensors. We combine a pair of silicon cochleae and a silicon retina on a robotic platform to allow the robot to learn sound localization through self motion and visual feedback, using an adaptive ITD-based sound localization algorithm. After training, the robot can localize sound sources (wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008